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ABSTRACT 

Previous models of galactic disk heating in interactions invoke restrictive assumptions not necessarily valid 
in modern ACDM contexts: that satellites are rigid and orbits are circular, with slow decay over many orbital 
periods from dynamical friction. This leads to a linear scaling of disk heating with satellite mass: disk heights 
and velocity dispersions scale oc M sat /Mdi s k- In turn, observed disk thicknesses present strong constraints on 
merger histories: the implication for the Milky Way is that < 5% of its mass could come from mergers since 
z ~ 2, in conflict with cosmological predictions. More realistically, satellites merge on nearly radial orbits, and 
once near the disk, resonant interactions efficiently remove angular momentum while tidal stripping removes 
mass, leading to rapid merger/destruction in a couple of free-fall plunges. Under these conditions the proper 
heating efficiency is non-linear in mass ratio, oc (M sat /Mdisk) 2 - We derive the scaling of disk scale heights and 
velocity dispersions as a function of mass ratio and disk gas content in this regime, and show that this accurately 
describes the results of simulations with appropriate "live" halos and disks. Under realistic circumstances, we 
show that disk heating in minor mergers is suppressed by an order of magnitude relative to the expectations 
of previous analyses. We show that the Milky Way disk could have experienced ~ 5 — 10 independent 1:10 
mass-ratio mergers since z ~ 2, in agreement with cosmological models. Because the realistic heating rates 
are non-linear in mass, the predicted heating is dominated by the more stochastic, rare low mass-ratio mergers, 
and the existence of populations with little or no thick disk does not require fundamental modifications to the 
cosmology. This also leads to important differences in the predicted isophotal shapes of bulge-disk systems 
along the Hubble sequence. 

Subject headings: galaxies: formation — galaxies: evolution — galaxies: active — galaxies: spiral — cosmol- 
ogy: theory 



1. INTRODUCTION 

In the widely-accepted ACDM cosmology, it is expected 
that mergers and interactions between galaxies are crucial 
to g alaxy assembly beca use structure grows hierarchically 
(e.g. IWhite & Reea fl978). On this basis, it has been pro- 
posed that spheroids are produced when ga laxies merge (mak- 
ing bulges in d i sks an d elliptical galaxies; Toomr e~& Toomrel 
19721: iToomrd 1 19771). Both theory (Ostriker & Tremaine 
19751 iMaller et al.ll2006h IStewart et alj|2007t iFakhouri & Ma 



2008, and references therein) and observation (e. g. Lin et al 



2004; Woo ds et alJ l2006: Bart on et al.ll2007l Woods & Gelle 



< 10:1) likely do not destroy disks, they are almost an order 
of magnitude more frequent than major mergers and as such 
may pose an even more severe problem for disk survival. 

In the ACDM cosmology it is unlikely than any disk with 
a significant stellar age has existed ~ 5-10Gyr without 
experiencing a merger of mass ratio 10:1 or larger. Ana- 
lytic a rguments (|Ostriker & Tremainell 19751: Toth & Ostriker! 
119921; ISellwood et al.l 1 19981) and early generations of nu- 
merical simulat i ons dOuinr] 1 19841: lOuinn & G oodman! 1 1986 ; 
Hernquisdll9"89tlOuinn et al.lll993HHernquist & Mihosl ll995: 
Walker et alJI 19961 iNaab & Burked 120031) indicate that gas- 



2007b indicate that "minor" mergers of mass ratios < 1 : 10 are 
commonplace, and that a large fraction (~ 1 /2) of the ~ L* 
galaxy population has experie nced a "major" merger (mass 
ratio < 1:3) since z ~ 2-3 dBell etaD 120061: iBridge et all 
l2007irKartaltepe et aLteOOHlLotz et alj2008l;lLin et al.l2008l) . 

The ubiquity of mergers has raised the possibility that cold, 
thin disks of galaxies may not survive to z = in hierarchi- 
cal models. From numerica l simulations (e.g. lBarneslll988L 
1 1 9921: iHernquistl 19921 [19931) it has been known for some time 
that major mergers destroy stellar disks entirely and cause a 
large fraction of disk gas t o lose angular momentum through 
gravitational torques (e.g. iBarnes & Hernqui"slll991l 1 19961) 
together yielding dense, quasi-spherical bulges (at least in 
typical situations such at those at low redshift. wher e the gas 
fractions are not large; see e.g. iHopkins et afll2008l) . More- 
over, although minor mergers (at least those with mass ratios 

1 Harvard-Smithsonian Center for Astrophysics, 60 Garden Street, Cam- 
bridge, MA 02138 

2 W. M. Keck Postdoctoral Fellow at the Harvard-Smithsonian Center for 
Astrophysics 



poor minor mergers can convert a large fraction of a stel- 
lar disk into bulge and dynamically heat the disk. The ob- 
served thinness of galactic disks s u ggests that this may be 
a severe problem: iToth & Ostrikerl (1 19921) argued that large 
disks such as that in the Milky Way could not have under- 
gone a merger of mass ratio < 10 : 1 i n the last ~ l OGyr . 
Mo re recently e.g. iGilmore et al.l (120021) : IWvse et al.l (120061) 
and lStewart et al.l (120071) emphasized that the tension between 
these constraints and the expectation in CDM models that a 
number of such mergers should occur implies either a prob- 
lem with our understanding of hierarchical disk formation or 
a challenge to the concordance cosmological model. 

Given the many successes of the ACDM model and obser- 
vations that disks do experience (and survive) a large num- 
ber of mergers, it seems likely that the main uncertainty is 
our understanding of disk galaxy evolution. This has fo- 
cused attention on the problem of forming disks in a cosmo- 
logical context. A number of studies have argued that de- 
tailed modeling of gas physics, star formation, and feedback, 
along with high numerical resolution, are required to simulate 
disk formation and understand how disks survive their vio- 
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lent merger histories dWeil et al.ll 1998; Sommer-Larsen et alj 


1999, 2003; Thacker & Couchman 2000, 2001; 


Abadi et al. 


2003; Governato et al. 


2004, 2007: Robertson et all 120041 


Okamoto et al.ll2005; Scannapieco et al.ll2008l). But there has 



been relatively less focus on whether these simulations pro- 
duce disks - even where the disks avoid being converted into 
bulges - that are sufficiently thin relative to observational con- 
straints. Interestingly, more recent atte mpts to model disk 
heating in a cosmologic al context (see e.g. Benson et alJ2004t 
iKazantzidis et alj|2007l) suggest that disk heating may be sig- 
nificantly less efficient than has been assumed. 

Indeed, a closer examination of this question reveals that 
much of the conventional wisdom and the implied constraints 
on the merger history of massive disks (and the Milky Way in 
particular) is fundamentally premised on historical assump- 
tions that do not apply given what we now know about the 
A CDM framework . For example, implicit in the analysis 
of iToth & Ostriker (1992) and others is the assumption that 
the orbits of merging satellites can be described as being 
nearly circular, slowly spiraling in owing to dynamical fric- 
tion, while the satellite remains relatively intact. Early sim- 
ulations often adopted these choices - modeling the satel- 
lites as rigid point masses on almost perfectly circular orbits, 
including relatively low-mass and smooth dark matter halos 
(such that the background density is low, and the in-spiral 
of the satellite therefore relatively slow and without much 
tidal stripping), and/or using rigid (as opposed to full N-body 
or "five") background disk/bulge/halo potentials. Moreover, 
these calculations were almost always dissipationless, and did 
not allow for the role of gas in making the system more ro- 
bust against mergers and heating. Under these conditions, it 
is straightforward to show that the disk heating owing to a 
merger is linear in the mass of the satellite. This fundamen- 
tally underlies the constraint above: because the scaling is lin- 
ear, it is possible to transform an observed disk thickness or 
velocity dispersion into an integral constraint on the allowed 
merger history of a galaxy. 

Given cosmological constraints and numerical capabilities 
at the time, these were reasonable assumptions. However, 
with improvements in simulations, yielding the ability to per- 
form true cosmological simulations with resolved subhalo 
populations, and better understanding of cosmological pa- 
rameters and estimates of halo masses and sizes, it is clear 
that the assumptions underlying these early works are not 
appropriate. For example, it is now well-established that 
realistic cosmological orbits are quite radial, not circular. 
This is true on all mass scales, f rom large dark matter sub- 
structures which populate halos (|yan den Bosch et al] 1 19991 : 
Benson 2005; Khoc hfar & Burkertll2006l) to unresolved back- 
groun d velocity flows in the diffuse stellar and dark matter 
halo (ICole & La cey 1996: iAbadi et al.ll2006l) . Given that real- 
istic galaxy halos exhibit considerable substructure with large 
subhalo populations, it is generally true that systems which 
pass near a disk - those of interest for mergers and disk heat- 
ing - will be those which can transfer angular momentum and 
collide with the d isk on highly radial passages, rathe r than 
circular in-spirals dFont et al.ll2001l : lBenson et al.ll2004h . 

Moreover, properly accounting for live halos and disks 
leads to qualitatively different angular momentum transfer in 
the final stages of galaxy mergers: once a satellite reaches 
a radius comparable to the disk, resonances between the or- 
bital motion and internal motions of the disk and halo remove 
angular momentum efficiently and lead to a rapid, radial coa- 
lescence in a couple of dynamical times (rather than the large 



number of orbital times required for a dynamical friction in- 
spiral in the case of mass ratios > 1:10). Furthermore, allow- 
ing halos to have the large masses expected in modern cos- 
mologies and resolving disks with N-body approaches, with 
proper live secondaries, leads to efficient tidal stripping of the 
satellite once it begins to pass through a disk, transferring or- 
bital energy to the central bulge and extended dark matter halo 
(preferentially the hottest components of the system, rather 
than the disk). 

The historical picture of "sinking" satellites on slowly de- 
caying, circular orbits is based in part on an old view of 
the orbits of the Large and Small Magellanic Clouds (LMC 
and SMC). Recent observations, however, indicate that the 
LMC and SMC are on a high-velocity passage near peri- 
center and are only ma rgi nally bound to th e Milky Way 
( Kalli vayalil et alJ l2006allbl iPiatek et alJ l2008h . iBesla et al] 
(I2007F used these results and modern cosmological models 
of the Milky Way halo to infer their orbital motion, and ar- 
gued that they are in fact on highly eccentric radial orbits and 
are just now executing their firs t passage. Rad i al orb its are not 
uncommon in the local group: iHowlev et al] d2008l) recently 
demonstrated that NGC 205 (an M31 dwarf) is also likely 
on a radial passage near its escap e vel ocity; Leo I i s simi- 
lar (lMateoetal.ll 19981: ISohn et al.ll2007t iMufioz et al]l2008h . 
This, and the arguments above, suggest that constraints re- 
garding the merger history of the Milky Way and disk heating 
should be revisited in light of modern cosmological expecta- 
tions. 

One might think that the difference between the conven- 
tional lore on satellite heating and allowing for more realistic 
orbits would amount to a simple numerical correction - i.e. 
that the differences outlined above can be subsumed into an 
appropriate correction of the efficiency of disk heating, but 
will not fundamentally alter the scaling laws derived in the 
past (namely that disk heating is linear in merger mass). In 
what follows, we show that this is not the case. At a ba- 
sic level, the linear scaling resulting from earlier assumptions 
is pathological, and allowing for moderate deviations in line 
with what is expected in a ACDM cosmology yields a quali- 
tatively different scaling and a non-linear dependence of disk 
heating on satellite mass. This implies more than an order-of- 
magnitude suppression of disk heating in intermediate mass- 
ratio minor mergers, and essentially no contribution to disk 
heating from very minor mergers (unlike if the heating were 
linear, in which case there would be equal contributions to 
disk thickness from all logarithmic intervals in satellite and 
subhalo masses). We show that the scaling we derive, non- 
linear (second-order) in the satellite mass, is dynamically in- 
evitable and borne out in numerical simulations. 

We outline the physics of interest in § [2] explaining why 
cosmologically realistic merging orbits (even when nearly cir- 
cular initially) tend to be highly radial by the time of coales- 
cence and how this differs from the assumption of decaying 
circular orbits (§ 12. Il l, demonstrating how this change s th e 
qualitative scaling of disk heating with mass ratio (§ 12.21 ), 
deriving full analytic expressions for the disk thickness and 
vertical velocity dispersion as a function of merger and disk 
properties for the radial cases of interest (§ 12. 3b . and testing 
that these equations accurately describe the results of full nu- 
merical simulations (§ 12.41 ). We discuss the accuracy and ro- 
bustness of these approximations in § 12.51 examining when 
they are or are not appropriate. In §0 we consider the con- 
sequences of the different scalings owing to radial versus cir- 
cular orbits, in terms of the allowed merger histories of the 
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Milky Way and other massive disk galaxies (§ 13. Il l and the 
shapes of galaxies along the Hubble sequence (§ 13.2b . Fi- 
nally, we summarize our results and discuss some of their 
cosmological implications and applications to other models 
and observations in §|4] 

2. WHY DO REALISTIC ORBITS MAKE A DIFFERENCE? 

iToth & Ostrikerl ([1992) developed a simple model for disk 
heating by a merger with a satellite. They assumed that the 
satellite - of mass M%, with respect to the primary mass M\ - 
was on a circular or nearly circular orbit, which decayed by 
dynamical friction. Since the energy of an orbit with semi- 
major axis a is ~ GM\Mil 'a, the orbital energy lost by the 
satellite (which they assumed was converted into kinetic en- 
ergy of the nearby stars and dark matter, which themselves 
are the agents slowing down the orbit) in some annulus as the 
orbit decays from a — > a - da is ~ (GM\ M2 / a) (da /a) oc M2. 
Since the vertical potential for an isothermal sheet oc gH and 
the kinetic energy oc a\, given the assumption that once the 
disk relaxes it will obey the virial theorem (E = 3T, for a 
planar geometry) this implies that the local scale height H 
is proportional to the total energy deposited in that annulus 
(H oc Mi, if the disk was initially cold). In detail, they ob- 
tained that at a cylindrical radius R in the disk the average 
change in scale height owing to a merger is 

AH = ( 0.49 
R V1+0.434/V VMi[<r]/' 

where Hq = \0Hq/R ~ 1 is some internal (non-merger in- 
duced) scale height, but in any case the dependence on this 
term is weak; the primary scaling is AH /R <~ M2 / M\ . Indeed, 
simulations confirm that this scaling does reasonably describe 
the behavior of the disk in the limit of of a rigid satellite o n 
a circular orbit (see e.g. the discussion in lBenson et alj 2004), 
and that adopting the local virial theorem is a good approxi- 
mation for derivin g the scale height of the disk, given some 
energy deposition (lOuinn et al.ll 1 993h . 

2.1. How Are Real Orbits Different? 

However, a fundamental assumption here, that the orbits are 
circular and decay slowly (in a gentle circular in-spiral) by dy- 
namical friction while the satellite remains intact, bears little 
resemblance to realistic orbits in ACDM cosmologies. It is 
now known that satellites merge on highly eccentri c , nearly 
radial orbits (see e.g. [van den Bosch et al.l 1 19991; iBensonl 
120051: iKhochfar & Burkertll2006l) . Moreover, the decay of the 
orbit, while broadly governed by similar scalings such that 
e.g. the dynamic friction timescale is not a bad approximation 
(in an order-of-magnitude sense ; see e.g. Ivan den B osch et al. 
119991; IVelazquez &Whitdl 19991: iBoylan-Kolchin et al.ll2008l) . 
is usually nothing like the smooth circular decay assumed in 
lToth&OstrikelJl992l) . 

Typically, the infalling satellite and primary are deformed 
after first passage, and resonances between the internal fre- 
quencies of the systems and their orbital frequencies allow 
efficient transfer of angular momentum from the orbit to the 
halo, while the systems linger near apocenter after the pas- 
sage. They then fall in on a nearly radial orbit and quickly 
merge, with the braking of the secondary orbit occurring in a 
couple of quickly damped radial oscillations through the cen- 
ter of the primary. Even if first passage is at large impact pa- 
rameter, this process will still generally play out, but perhaps 
with one or two additional, increasingly radial, passages. At 



the same time, once the satellite approaches the radii of in- 
terest - where it is actually crossing and penetrating the disk, 
so that disk heating can be effective - it rapidly begins to be 
tidally stripped and destroyed by the gravitational interaction, 
and cannot execute a large number of orbits without losing 
some significant amount of mass (such that only the first cou- 
ple of passes are the dominant perturbation, not the sum over 
all passages). 

The radial nature of these passages in part owes to cosmo- 
logically radial orbits, but it is also an inevitable consequence 
of any orbits that will actually merge in a dark-matter domi- 
nated universe. To see why this is true, consider the nature of 
dynamical friction for a satellite at some instantaneous three- 
dimensional radius r from the primary center. Technically, 
the approximation of slow circular orbital decay owing to dy- 
namical friction, which implicitly relies upon the background 
being infinite, homogeneous, isotropic, and (in a bulk sense) 
unchanged by the motion and gravity of the secondary, is valid 
only when the enclosed mass within a sphere of radius r is 
much greater than the gravitational mass (baryons plus what- 
ever dark matter halo remains bound) of the secondary. So, 
for example, a small secondary far out in the halo (e.g. the 
common case of a satellite at large r in a massive cluster) can 
experience orbital decay owing to traditional dynamical fric- 
tion. 

However, once it reaches a radius where the enclosed mass 
is somewhat comparable to the satellite mass, this orbital de- 
cay approximation is clearly no longer appropriate (consider 
e.g. the limit of two identical point masses, one the satel- 
lite - when they are close enough that the mass of the back- 
ground halo is negligible, they would just enter into a sta- 
ble, marginally bound binary orbit, and coalesce slowly). Ef- 
fects like these are well-known even in numerical simulations 
that model t he idealized case of a r igid satellite in a smooth 
halo (see e.g. Hashimoto et al. 2003, who argue for a radially- 
varying Coulomb logarithm to match N-body results and the 
analytic prediction), although these do not include the res- 
onant effects that ultimately remove most of the orbital an- 
gular momentum. For a satellite that is small relative to the 
primary (M2/M1 <c 0.1, say), this radius may be negligibly 
small. However, for a satellite that is even a moderate frac- 
tion of the primary mass (Mz/M\ > 0.1), this will happen, for 
any reasonable mass profile, when the secondary gets near of 
order the scale length of the primary (considering the halos 
of the systems, this is really the scale length of the primary 
halo - i.e. substantially larger even than the scale length of 
the primary disk). This is precisely the radius of interest for 
disk heating. In other words, by the time the secondary is 
close enough to the primary that it can actually move through 
and heat the stellar disk, it is, by definition (for any secondary 
which is a significant mass fraction of the primary) in a regime 
where dynamical friction and circular orbital decay are no 
longer valid approximations. 

Rather, as we noted above, it is in this situation that res- 
onances between the internal velocity of the satellite and its 
orbital motion will allow efficient transfer of orbital angular 
momentum, leading to a radial infall. Because these reso- 
nances act efficiently when the internal velocities of the galax- 
ies are comparable to their orbital velocities, it is precisely 
the regime where the secondary is near the primary - where 
circular orbital decay from dynamical friction ceases to be a 
valid approximation - when these mechanisms act efficiently. 
Because internal motions (of at least some parts of the halo) 
are in resonance with those of the orbit, that dark matter is 
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strongly accelerated, removing angular momentum from or- 
bital motion far more efficiently than dynamical friction, al- 
lowing an orbit to transform from nearly circular to nearly 
radial in just one or two passages. Since the angular momen- 
tum of the orbit is transferred quickly in a passage (where the 
system spends most time near some apocenter), the satellite 
cannot gently spiral in but must return from apocenter in a 
radial plunge. For a satellite falling in on a radial orbit, it is 
effectively a free-fall at the escape velocity. The braking by 
dynamical friction on this infall is weak as the system free- 
falls (recall, it takes many circular orbits for the orbit to decay 
by standard dynamical friction). 

Rather, when the secondary finally punches through the pri- 
mary and begins to oscillate through the primary on a ra- 
dial orbit, violent relaxation (the scattering of stars in the 
secondary and in the center of the primary) off of the time- 
dependent central potential transfers energy from the radial 
orbit of the secondary to the motion of the central stellar dis- 
tribution (preferentially to whatever stars are already "hottest" 
in the center of the galaxy). This quickly damps out the or- 
bit, completing coalescence in a couple of dynamical times. 
The energy of the orbit, by this rationale, will largely have 
gone into heating the bulge of the remnant. This is both 
because the bulge dominates the stellar mass density at the 
center of the galaxy, and because the hot/spherical compo- 
nent of the galaxy (having a more isotropic orbit distribution, 
with a much larger population on radial orbits compared to a 
cold disk) is in resonance with the orbital decay of the sec- 
ondary. Some new bulge will be created by violent relax- 
at ion and heating of the central stellar populations - we show 
in lHopkins et al.l d2008l) that the population violently relaxed 
is given by the stars within a radius enclosing a stellar mass 
comparable to the stellar mass of the secondary. In short, a 
mass and energy ~ M2 is added to a classical bulge from the 
infall, not to heating the disk. 

A similar caveat applies to considering a constant-mass sec- 
ondary. Implicit in the circular orbit in-spiral approximation 
- in fact, critical to the AH/R oc M2/M1 scaling - is the idea 
that the satellite heating is integrated over a large number of 
orbits crossing a given radius r while the orbital angular mo- 
mentum gradually decays. Clearly, if the satellite is signifi- 
cantly stripped of mass and/or partly destroyed by shocking 
or tidal interactions after a few orbits, then it is only the first 
single passage or couple of passages that dominate the heat- 
ing. In such a case, modulo a geometric prefactor, the effects 
should be identical to the single radial infall or plunge sce- 
nario described above (even if the orbit is still formally per- 
fectly circular). In short, if the satellite loses mass after a 
couple of passages, then the dominant passages seen by the 
disk will be the first couple, which will heat the system the 
same whether they are circular or radial (the exact direction 
in which the secondary is traveling does not enter into the 
second-order heating below). 

Another way of phrasing this is to note that a significant 
fraction of the orbital energy is no longer lost to heating the 
disk, but carried away by the tidally stripped material (which 
could itself heat the disk in principle, but numerical experi- 
ments show that it generally is added to the halo and in more 
spherical fashion than the disk, while retaining its coherent 
orbital motion rather than heating nearby material, and more- 
over because the material is, essentially by requirement in 
order to be stripped, thin and diffuse relative to the poten- 
tial nearby, it cannot effect significant heating in the manner 
of the entire satellite if it moves through the disk; see e.g. 



iKazantzidis et al.ll2007l) . Moreover, this process can acceler- 
ate the orbital angular momentum loss, by shedding angular 
momentum from the orbit preferentially in the stripped mate- 
rial (there is a general tendency for angular momentum trans- 
port to the least-bound material in interactions). 

As was the case above for the resonant interactions that re- 
move angular momentum, we can roughly derive the radius 
where stripping is expected to become important, and find 
(unsurprisingly) that it is again precisely the radius of inter- 
est, i.e. the radius where the secondary begins to actually in- 
teract with and cross the primary disk (since the primary disk 
is much more dense than its extended halo, this will almost 
always be the case). Even ignoring the more dense bary- 
onic components, if we model the secondary and primary as 
iHernquisl (fl990) profile spheres with effective radii a oc M 1 / 3 
(as expected and observed for bo th halos and disks; see e.g. 
iMo et al] [19981: IShen et alJ 120031) . then calculate the formal 
tidal radius f = ri_ja2 within the secondary as a function of 
the separation x=r/a\, the dependence on mass ratio cancels 
out entirely and we have r (1 + r) 2 = (1 +x) 3 /2 - i.e. material 
within a fraction w 0.8 {r/ci\) times the effective radius of M2 
will be stripped when the satellite is at a distance r relative 
to the effective radius a\ of the primary. In short, precisely 
the time of interest, when the satellite enters the effective ra- 
dius of the primary, is when th e satellite begins to be rapidly 
stripped (for a more details, see lBenson et al.ll2004h . 

2.2. How Much Heating Arises From Realistic Orbits? 

How much disk heating is effected in this process? There 
are several ways to cast this problem, but all yield the same 
behavior. For example, we can even obtain the correct scaling 
by considering a pure dynamical friction derivation, despite 
the caveats above, so long as we acknowledge the more real- 
istic nature of the orbits. For simplicity (ignoring temporarily 
the caveats just given) let us assume that the secondary moves 
through an isotropic homogeneous background density, with 
(for analytic convenience, although it makes little difference 
to the end scaling) an isothermal sphere profile. The instan- 
taneous effective dynamical friction force on the secondary 
is 

C A/ 2 

F df = -0.4281nA^-v, (2) 

r L 

where In A is the appropriate Coulomb logarithm (we will 
suppress this for now, but including it is comparable to e.g. the 
uncertainties owing to details of the profil e shape, and numer- 
ical experiments suggest In A k, 1—2; seelOuinn & Goodman] 
[19861: 1 Velazquez & Whitdfl999l;lBovlan-Kolchin et alj|2008h . 
The force is, by definition, in the direction of the secondary 
velocity vector v (and opposed to the velocity, hence being 
negative). If the secondary travels a distance ds, then, an ef- 
fective work is done by dynamical friction involving a transfer 
of energy from the orbit of the secondary to the motions of the 
stars/dark matter that define the background, with magnitude 
F d fds=|F df ||ds|. 

If the satellite is on a circular orbit, then this motion ds 
is largely in the azimuthal direction, and the radius r decays 
slowly. The tangential dynamical friction force gives a rate of 
change of specific angular momentum L of dL/dt = F^r/Mz- 
Since for a circular orbit L = rV c (where V c is constant for 
the isothermal sphere), and ds = V c dt, we can rewrite this as 
dr = -0.428 (GM2/V 2 r)ds, if the satellite remains constant 
internally throughout the orbit(s). Since the enclosed mass 
within r, M enc = V?r/G, this is dr = -0.428 (M 2 /M enc )ds. In 
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order for the circular orbit to decay in radius by dr, then, a 
work {GM2M enc /r)(dr/r) must be done. S i nce M i ks M enc , 
we have just recovered the Toth & Ostrikerl d 19921) formula, 
for the specific case of an isothermal sphere. The imparted 
energy in the annulus dr is oc M\ M2, so the relative heating 
(i.e. imparted energy relative to the characteristic energy of 
stars at r in Mi, ~ GM\/f) goes linearly in the mass ratio, 
oc M2 /Mi . In short, even though the instantaneous force oc M\ 
as the secondary orbits, it takes ~ {M\/M2)dr/r orbits to de- 
cay in radius dr, so the total energy imparted in an annulus 
scales oc M\ M2 (note that this is why it is important that the 
satellite remain intact in this derivation, since we have implic- 
itly assumed that the same mass M2 makes ~ M1/M2 cross- 
ings of the disk). 

The situation is trivial for a radial orbit. Here, ds = dr, so 
the work done as the secondary falls inwards by dr is just 
0.428 (GMj/r) (dr/r). The imparted energy oc M\ , so the rel- 
ative heating is quadratic in the mass ratio, oc (M2/M1) 2 . Note 
that this is valid in an instantaneous sense (i.e. with M2 mea- 
sured as the system enters the radius r) even if the system is 
being stripped or destroyed with each encounter in the disk. 

We could instead treat the problem entirely differently, and 
obtain the same answer. For example, assume instead that we 
face the opposite regime, where the orbit punches through the 
disk rapidly instead of being gently dragged, in which case 
the impact approximation is reasonable. Moreover, this could 
be used to describe the response of the disk at large r to the 
damping radial oscillations as the secondary rapidly merges 
in the center. In either ca se, a full impact app roximation so- 
lution (derived following iBinnev & Tremainel ll987l) follows 
for circular orbits and a rigid satellite, assuming the impact 
parameter b = r and impact velocity V, ~ V c , and allowing for 
a number of independent passes ~ M\ /Mo as required by the 
energetics for the circular orbit to decay. This yields the same 
result, that the energy imparted at r goes as oc GM\M2/r. 
For a radial orbit (or allowing for satellite destruction), we in- 
stead assume a penetrating encounter (the radial limit b = 0), 
with a rapid decay (technically we can fully solve with some 
exponential radial decay time or stripping/destruction time 
t oc fdyn, but in any case this only is relevant for the numerical 
prefactor), and we obtain the same scaling as above, with the 
energy imparted oc GM\ / r. The details of our derivation only 
change the exact numerical coefficients. 

It is easy to show that these scalings more or less have to be 
true. For circular orbits and a rigid satellite, as described at 
the beginning of this section, one can simply take the differ- 
ence in energy between a circular orbit at radius r and one at 
radius r + dr, which is (GM1M2/ V)(dr / V) - this energy must 
go somewhere, so modulo the degree to which it is spread 
over the disk (i.e. the Coulomb logarithm), the scaling must 
be similar. For a radial orbit, there is no difference in orbital 
energy in a single infall at r and r+dr, and even in the circu- 
lar case only the first couple of passages matter, so the orbital 
energy loss is negligible (it takes many circulations for the or- 
bit to decay). The energy exchange must instead come from 
the deceleration from the wake or interaction of the secondary 
with the extended primary - i.e. the back-reaction of the sur- 
rounding material from the acceleration imparted by M2 itself. 
This effect is necessarily second-order in M2 (and arises only 
because the real distributions of material are extended, rather 
than point sources). So long as the radial orbit decays quickly 
once it actually penetrates the galaxy with small impact pa- 
rameter, there will not be some large number of passages at 



large r that could change this (and it is almost impossible 
to arrange a realistic stellar distribution that will not quickly 
damp a penetrating radial orbit once it passes through the cen- 
ter of the system). 

2.3. A Re-Derivation of Disk Heating for Realistic Orbits 

Here, we present some useful equations for disk heating 
o wing to more real i stic or bits. Most of the details are derived 
in lToth & Ostrikerl (| 19921) . The key quantity that differs, de- 
pending on the nature of the orbit, is the energy deposition 
or exchange in an annulus of width dr. It is convenient to 
express this in terms of the absolute value of the energy ex- 
changed between orbit and background (i.e. the work done 
W, or the energy lost in the orbit) per unit area, Ae = dW/dA. 
For an isothermal sheet potential (a reasonable approximation 
in a sufficiently local patch of the disk), the potential (in the 
vertical direction) is proportional to the scale height H times 
the local surface gravity and the vertical kinetic energy is just 
dMa 2 /2. If the system obeys the local virial theorem then 
E = T + U = 3T, and it is straightforward to calculate the heat- 
ing from some energy input Ae: 

2 Ae 
H (r) = T- 7^ (3) 



^TiG^rYd + ^r)) 



3£ d (r)(l + ^(r))' (4) 

Here, SdW is the disk surface density at r and the term 
(1 +rjh{r)) is an order unity (generally small around R^, the 
scale radius of the disk) correction for the disk-halo gravity. 
That these are good approximations to full numerical results 
(e.g. that the system obeys the local virial theorem and that 
the isothermal sheet approximation is re asonable) has been 
confirmed in a number of experiments dOuinn et al] 1 19931 
| Walkeretal]|1996t IVelazquez & White! 119991; iBenson etal 
120041) . 

Based on our derivation above, we expect for a radial or- 
bit moving through a background of density p that Ae = 
(Fdfdr)/(2ir rdr), giving 

G 2 M 2 p 



i 



At 



(5) 



in full generality, where a is the appropriate constant depend- 
ing on the Coulomb logarithm and the isotropy of the back- 
ground velocity distribution. 

Up to this point, we have made no distinction between a 
dissipationless and dissipational disk. If there is gas in the 
disk, it can radiate energy efficiently. For simplicity, assume 
the gas is distributed in the same manner as the stars in the 
disk with gas fraction / gas = M gas /(M gas + M„), and that the 
energy correspondingly couples evenly to the gas and stars 
(i.e. a fraction / gas couples to the gas). If the gas can effi- 
ciently radiate that energy, then the coupled energy of interest 
(that which will not just radiate away) is Ae e ff = (1 - /gas) Ae. 
Given this effective energy injection, our previous equations 
for H(r) and a 2 are unchanged (they will change only on the 
2-body relaxation time of the local stellar disk, as energy can 
be re-exchanged between individual stellar orbits and the thin 
gas disk, but this timescale is long relative to the heating of 
interest). 

For simplicity, assume the halo is an isothermal sphere, 
such that the circular velocity is approximately constant as 
a function of radius. This then reduces to 



A-f (1- 



/gas) 



GM 2 
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and 

^- = a//(l-/gas)(^) 2 h(R/Re,6isl) (7) 
«c,disk ^d 7 

Ad, 2 /M 2 \ 2 

Vc.disk v ™ d / 

where a« w 1-2 and a ff 0.3-0.6 are the appropriate nor- 
malization constants and h(R/Rd) ~ s(R/Rd) ~ 1 are weak 
functions of 7? (for convenience, we define them so that 

fc(l) = iU) = l). 

In detail these quantities will depend on how the orbital de- 
cay proceeds as a function of radius. For the precise case 
derived above on a purely radial orbit that decays instantly 
at r — > (and isotropic heating), we obtain if the disk is a 
iMestell d 19631) disk (constant circular velocity V c ,disk, where 
we define v = (Vc,disk/Vh) 2 ~ 1 relative to the halo circular ve- 
locity, and S d oc \/R) a H = 1.60vand h(x) « [0.7x + 0.3 v]" 1 
(giving a roughly constant AH ove r the radii of interest, sim- 
ilar to what is observed; see e.g. IdeGrijs&Peletieri 119971: 
iBizyaev &"M itronova 2002), with a a = 0.57v, s(x) = x~ 2 . For 
an exponential disk (with V c .disk defined as the maximum 
circular velocity of the disk) we obtain an = 0.68 v, h(x) = 
[2x 2 exp(-jt) + 0.26 v] -1 and a a = 0.32, s(x) = x~ 2 . 

However, more realistically, the normalization here reflects 
a sum over various passages as the complex resonant in- 
teractions described above rapidly remove angular momen- 
tum from the secondary. At the same time, tidal stripping 
is continuously removing mass, and the gravitational mass 
of the primary can effectively appear different depending on 
where in the potential the secondary is located. It is possi- 
ble to develop a much more complex analytic model that at- 
tempts to account for these effects, including e.g. some model 
of dynamical friction, stripping, disk-satellite shocking, and 
then time-integrating these eq uations to produce the numeri- 
cal constant desired (see e.g. iBenson et al. 2004). Although 
these offer some improvement, it is still unfortunately not 
possible to follow the proper sources of angular momentum 
loss (resonant coupling in the orbit) and back-reaction of the 
halo and secondary in fully analytic form, and because of this, 
these more sophisticated treatments do not yield a major gain 
over our much simpler scalings above. 

Fortunately, testing these predictions in full numerical sim- 
ulations (see § 12.41 below) - including live halos and secon- 
daries, systems with a huge number of very small satellites, 
and gas-rich disks evolved in full hydrodynamics - we find 
that the scalings above nevertheless accurately describe the 
disk response, with a simulation-to-simulation scatter of only 
a factor ~ 2 — 3. Calibrating the normalization of the effi- 
ciencies properly to N-body results, we obtain an ~ 1.0-2.0 
and a a w 0.3-0.6, similar to the coefficients estimated an- 
alytically. We find that these calibrations are robust across a 
wide range of simulations and mass ratios, and describe the 
behavior of AH / R and Aa 2 /v 2 well from ~ 1 - 5 Rd (outside 
of which the disk is fairly ill-defined, both numerically and 
physically, so we are not surprised that the scalings do not 
trivially extrapolate to infinite R). 

2.4. Numerical Tests 

In order to test how robust these scalings are, we compare 
them with N-body simulations. We compile results from a 
number of different systematic studies of mergers, in order to 
see whether the particular choices in a given study are patho- 
logical. 



IVelazquez & White! d 19991) consider collisionless mergers 
of an N-body disk+bulge+halo Milky Way-like model pri- 
mary (although the system is collisionless, a nd so can b e 
freely rescaled to arbitrary mass) with N-body iKingl (119661) - 
model (truncated isothermal sphere) secondary satellites of 
varying concentration and mass ratios 1:5 and 1:10. The au- 
thors perform a survey of orbital inclination (from prograde 
9 = 0° to retrograde 9 = 180°) and impact parameter (equiv- 
alent to varying the initial eccentricity or circularity of the 
orbit, with ej = J/Jc(E) being the ratio of the orbital angular 
m omentum J to that of a circ ular orbit Jq varied). 

IVillalobos & Helrnl (120081) consider dissipationless merg- 
ers of a z = Milky Way-like primary and a scaled z = 1 
analog, with disk and satellite prop erties scal e d acc ording 
to theoretical expectations following iMo et al. (1998) (note 
that this may in fact over-estimate the redshift corrections, 
making their z = 1 cas e more appropriate for z ~ 2-3; see 
ISomerville et al.ll2.Q08b . They model the primary halo as adi- 
abatically contracted, and merge it with either bulge+halo or 
disk+halo satellite models of mass ratio 1:5, on varying orbits 
w ith a moderately radia l impact parameter. 

lYounger et al.l d2008l) consider full hydrodynamic merger 
simulations of two bulge+disk+halo systems, with mass ra- 
tios 1:8, 1:4, 1:3, and 1:2. In each case, the orbital inclination 
is surveyed, and two sets of models are run: one with a rel- 
atively low gas fraction ~ 0.2 in the disk at the time of the 
merger (corresponding to Milky Way like disks over the last 
few Gyr) and one with intermediate gas fractions ~ 0.4 (cor- 
responding better to intermediate redshift z > 1 cases). Most 
of the orbits are parabolic and radial (b ~ 1 -2 /?</), but the 
authors consider limited surveys in the impact parameter and 
corresponding circularity /initial angular momentum of the or- 
bit (from b = 0- IQRd)- Because, for dissipational systems, 
the gas properties set an absolute units scale, the physical 
mass and size scale of the galaxies cannot be freely rescaled: 
the authors examine primarily a Milky Way-like primary, but 
also consider a limited mass survey spanning a factor ~ 10 in 
lo wer primary mass es. 

Havashi & Chibal (|2006) investigate dissipationless merg- 
ers of a disk+bulge+halo primary (including both rigid ha- 
los+bulges and restricted halos free to move but not real- 
ized in N-body fashion, with an N-body disk; note that 
IVelazquez & Whita d 19991) argue that these restricted halo 
models may artificially inflate the resulting scale heights by 
a factor ~ 2, but this is comparable to the scatter in any 
case). The halo is merged with a large number of satel- 
lites (subhalos), drawn in Monte Carlo fashion from a mass 
spectrum with power-law index dN/dM oc M~ 2 , similar to 
the sub-Milky W ay end of the halo mass function (see also 
lArdi et all I2003L who perform a similar exercise and reach 
similar conclusions). A mass fraction =0.1 of the primary 
is placed into satellites according to that mass distribution, 
with a t heir spatial distri bution varied systematically (fol- 
lowing a iHernquisl d 19901) profile), and their initial velocity 
distribution determined assuming they follow the local halo 
velocity ellipsoid with either an isotropic dispersion tensor 
((3= 1-0.5 (ex 2 , + cp£) I a 2 = 0) or a radially anisotropic disper- 
sion (J3 = 0.5). In addition to their spatial and orbital velocity 
distributions, the absolute number of subhalos and maximum 
and minimum masses sampled are varied, and both point- 
mass subhalos and extended-mass (N-body) subhalo models 
are considered. 

FigurefUplots the results of these experiments. Specifically, 
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FIG. 1. — Analytic predictions for disk heating in mergers and results 
of numerical experiments. Top: Disk heating (change in scale height at 
R Kb R e ) AH(R)/R versus merger mass ratio M%jM\, Contrast our pre- 
diction AH oc (M2/M1) 2 (Equation (7) ; black lines show a factor 2 scat- 
ter) and that of Toth & Ostriker ( 1992) for rigid satellites on circular orbits 
(AH oc M2/M1; orange dotted). We compare N-body simulations of disk- 
satellite mergers of various masses (labeled). Each simulation set surveys 
orbital parame ters (inclination, impact par ameter, angular momentum). The 
dissipationless Velazquez & White 11999) simulations also survey satellite 
mass p rofiles and concentrations (c); the dissipationless Villalobos & Helmi 
12008) simulations scale systems to z. = and z = 1 , and consid er both disk 
and spheroid satellites. The hydrodynamic Younger et al. (2008) simulations 
survey disk gas content (disk heating scales oc (1 — /gas); to factor out this 
dependence, we plot these points as AH /R X 1/(1 —/gas))- Middle: Heating 
versus the linear sum over encounters in Havashi & Chiba 12006) simula- 
tions. They consider simultaneous mergers with many (~ 100—1000) small 
satellites (M2/M1 ~ 0.001 —0.1). Each point is the net heating, integrated 
over all encounters, for different initial configurations. If heating were lin- 
ear in Mi /Mi, there should be a linear correlation between the net heating 
and the linear sum over mergers, EiV,- (M,/M2). Bottom: Heating versus the 
quadratic sum over all encounters, as predicted by our Equation {7}. Numeri- 
cal experiments confirm our predicted scaling with factor ~ 2 - 3 scatter, and 
demonstrate that the scaling of disk heating is second-order in MijM\ . 



we compile (for the experiments described) the resulting scale 
height of the disk after the merger, at some radii ~ 1 —ARj (i.e. 
within a factor of 2 of R e , where we do not expect a strong 
dependence of H on R). In all cases, the authors evolve cor- 
responding isolated disks, so the value we consider is AH, 
the change in scale height relative to an isolated disk (i.e. sub- 



tracting the effects of internal heating). We convert all the re- 
sults to a uniform definition in order to compare them in a fair 
manner: we define AH precisely as the median scale-height 
of stars in the annulus dR (i.e. exactly half the disk stars in 
the cylindrical radial annulus dR should lie in z = ±H), and 
obtain AH as close to the disk half-mass radius R e as possible 
(but in any case rescale all the results to AH(R)/R for the R 
measured). 

Firs t, we consider the simul ati ons fr om lVelazciuez & White 
1999), IVillalobos & Helmil (120081) . and lYounger et al. 
2008), who consider idealized single encounters (as opposed 
to large numbers of multiple or simultaneous encounters). We 
compare AH(R)/R as a function of the mass ratio of the en- 
counter /1 = M2/M1, for the v arious simulation s , to t he an- 
alytic predicted scalings from iToth & Ostrikerl (1 1992b (lin- 
ear injx)_jind_our derivation herein (second-order in /z). For 
the lYounger et al.l d20 08) hydrodynamic simulations, we con- 
firm the predicted scaling AH oc (1 -/ gas ) (and a more lim- 
ited subset shows that this applies reasonabl y well over a 
system atic study varying / gas = - 1 ; see e.g. iHopkins et al.1 
2008). To compare uniformly with the other (dissipation- 
less, /gas = 0) numerical simulations here, we actually plot 
AH /R x 1/(1 -/g as ), so that all simulations are on the same 
footing. For each set of simulations and mass ratio, there is 
a factor ~ 2 - 3 scatter owing to a combination of orbital pa- 
rameters (both orbital inclination and impact parameter) and 
the detailed choices for the structural properties of the galax- 
ies (recall, the simulations plotted make various and in some 
cases quite different assumptions about the shapes of sec- 
ondary galaxies and their mass profiles). In any case, the scat- 
ter is still reasonably small, and despite the fact that these sim- 
ulations include e.g. the more complex realistic orbital struc- 
ture, tidal stripping, and other effects, they agree quite well 
with our predicted scalings for the range in normalizations 
a H w 1-2. 

If we were to consider arbitrary normalizations a# (i.e. ig- 
nore the analytic expectations entirely), then for any single 
encounter, there is of course a formal degeneracy between 
the dependence of resulting scale height on mass ratio and 
the efficiency (implicit in an)- Even for the set of simu- 
lations just described, the dynamic range in /1 is relatively 
small and scatter sign ificant, so if we freely renormalized the 
IToth & Ostrikerl (Il992h prediction, we could find a solution 
that was not a particularly bad match to the numerical results 
(albeit still not as good a fit as the predicted quadratic depen- 
dence, even with a fitted a# that differs from the analytic ex- 
pe ctation by almost an ord er of magnitude). The simulations 
of lHayashi & Chibal (120061) . however, include the effects of a 
large number of encounters with a wide range of mass ratios 
(~ 2 orders of magnitude in /i), and can strictly break this 
degeneracy. 

If the true scaling of AH /R were linear in /1, regardless of 
its normalization, then, in a sum of /V, encounters with vari- 
ous systems of mass ratios the net heating should be lin- 
ear in the sum of these ind ividual events (modulo some scat- 
ter), i.e. AH/R oc EJV,- /z,-. lHavashi & Chibal d2006l) consider 
this, and we show their results (renormalized appropriately 
for our definitions) in Figure Q] Note that because some of 
the orbits they consider are very long axis and/or energetic, 
a given satellite (especially in the cases where they use point 
mass satellites, since there is no stripping/destruction of the 
satellite) can make many well-separated passes through the 
disk. The appropriate sum is then over each encounter, treat- 
ing them separately with /1, appropriate for the mass ratio at 
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each encounter that penetrates the disk (for this reason, with 
some satellites making many passages, the sum SiV; /i; can be 
> 1, despite the fact that the total subhalo mass in these sim- 
ulations is only a fraction of the primary mass). Each point 
in the figure represents the sum over many small mass ra- 
tio encounters with one of the initial configurations described 
above, typically mergers with ~ 1000 satellites spanning a 
mass spectrum from /i, ~ 10~ 3 - 10 _I . In any case, the result 
is essentially a scatter plot, with no significant correlation vis- 
ible. 

If, instead, the appropriate scaling goes as /i 2 , as we have 
predicted, then the net heating in this ca se should behave 
as oc E jV;M?- Considering the results from Haya shi & Chibal 
( 2006) in this comparison, we do indeed find a tight linear cor- 
relation (as the authors themselves note). In short, the heat- 
ing from each of the many encounters is clearly adding 
quadratically - i.e. the effective heating per event scales oc [if, 
not linearly. Because this is a mix of mass ratios, project- 
ing versus SA^/i, degrades the correlation increasingly the 
more the merger history is varied. We also reproduce the 
simple r merger history simulation s fro m I Velazquez & White 
(fl^ . lViUalobos & Helmil d2008l and lYounger et alJ (120081) 
in this plot, with our analytic expectations. For these single 
encounters, we take Nj = 1 with /if being the initial mass ra- 
tio, appropriate for the fairly idealized mergers these repre- 
sent (although they vary other properties not considered in the 
lHayashi & Chibal (|2006) simulations), a reasonable approxi- 
mation also for the reasons below. 

2.5. How Robust Is This Scaling? 

Given the complexity of the full time-dependent behavior 
in these simulations, and the wide variation in the physics in- 
cluded and assumptions made by the different authors, it is 
remarkable that the agreement is as good as it is in Figure Q] 
Why should our simple scalings (Equations |7]|8]l describe the 
full N-body results as well as they do, with only a scatter of a 
factor ~ 2-3 in AH/R and no dramatic systematic offset be- 
tween different simulations? There are in fact several reasons 
for this. 

Most important, despite the fact that there is considerable 
complexity in the orbital decay of satellites, is that once it be- 
gins to experience close passages and penetrating encounters 
with the disk, the satellite is efficiently tidally stripped and 
destr oyed within a couple of passages (see e.g. lOuinn etal.1 
U993tlVelazquez & Whitdll999tlBenson et al.ll2004l) . The re- 
sult is that the approximation of a single "plunge" with in- 
stantaneous decay/destruction is quite good. Furthermore, the 
tendency of resonant interactions between disk, halo, and sub- 
structure to efficiently remove angular momentum also serves 
to make final passages more radial, removing some of the de- 
pendence that might otherwise result from initially different 
orbital configurations. Critically, all of these modern simu- 
lations include live disks and massive dark matter halos. In 
addition, differences between the effects of individual pas- 
sages are averaged out in realistic cosmological ensembles 
with many encounters. 

These numerical experiments also allow us to ask the ques- 
tio n of how circular or rad ial a given orbit should be for either 
the lToth & Ostrikerl(ll992l) derivation or ours, respectively, to 
apply. Clearly, given the variation in orbital parameters of 
the cases shown in FigureQ]and agreement of the results with 
our predicted quadratic dependence on \i, the radial case de- 
rived here, where the imparte d energy oc (M 2 /Mi) 2 , i s gen- 
eral, and the circular case in lToth & Ostrikej] (1 19921) . with 



impart ed energy oc (M 2 /M i ), is p athological. The encoun- 
ters in IVelazquez & White! (1 19991) span a range in circular- 
ity fr om J/Jc(E) = 0.3 - .8, and the impact parameter study 
from lYounger et all d2008l) covers ~ 0.1 — 3 in the ratio of 
the angular mome ntum to tha t of a circular o rbit. Moreover, 
lHayashi & Chibal (120061) and lArdi et all (l2003h study full orbit 
distributions, both isotropic and moderately radially biased, 
which should include large populations of initially more tan- 
gential orbits. In all cases, the end results still scale more gen- 
erally with the predicted quadratic oc (M 2 /Mi) 2 rather than 
lineai- oc (M%/M\). 

It is easy to see why this must be the case. Recall, the in- 
stantaneous work done by the secondary as it moves some 
distance ds is oc M\ As (second-order in M 2 ). The case of rigid 
satellites on circular orbits yields a linear scaling in M 2 be- 
cause the number of orbits required precisely cancels one or- 
der in M 2 - i.e. in order for the orbit to decay inward by dr, 
it must execute ~ (Mi/M 2 )(dr/r) orbits. So the net heating 
as it moves through the annulus is the combination of these 
two factors: M 2 x (Mi /M 2 ) ~ M\ M 2 . This requires a rather 
precise cancellation - if any effect can upset this - i.e. if for 
some reason the large number of circulations is suppressed, 
and in fact the satellite does not execute ~ (M1/M2) identical 
orbits for each unit radius it decays - then the scaling will just 
go as M\ . 

Such a suppression is almost inevitable: first, as discussed 
above, a realistic extended secondary will rapidly lose mass 
and eventually be tidally destroyed as it executes such circu- 
lations. In practice, the secondary, especially in e.g. a 1:10 
case where it would require ~ 10 full orbits through each 
radial interval to yield a linear scaling, never survives more 
than a couple of penetrating passages without losing a sig- 
nificant fraction of its mass. The subsequent passages are 
exponentially suppressed as the secondary loses mass, and 
a proper re-derivation of the heating gives an energy trans- 
fer rate that goes like M\ [ 1 + exp (-Mi /M 2 ) (Mi /M 2 )]- so the 
leading term is similar to our estimate for a single passage. In 
terms of the instantaneous heating, it does not matter whether 
the orbit is radial or circular - our derivation is really more 
broadly applicable to all cases where a very long (secondary 
mass-conserving) in-spiral i s not possible. 

Second, as discussed in § 12.11 the system does not, in fact, 
need ~ Mi /M 2 circulations to lose a significant fraction of its 
angular momentum; N-body simulations including extended 
systems typically find that angular momentum exchange in 
resonances (and mass loss) makes only one or two close pas- 
sages sufficient. 

Third, in systems with a large number of bodies (halo sub- 
structure), there tends to be efficient transfer and segregation 
of angular momentum. The satellites on circular orbits will 
preferentially occupy relatively large impact-parameter orbits 
in the o uter regions of th e halo, not coming close to the disk 
(see e.g. lFont et alj200"fl) . The systems that will have encoun- 
ters with the disk are on radial orbits, and so can be considered 
a series of independent highly radial passages, rather than a 
continuous circular heating. It is difficult to obtain circular 
orbits at small radii, and usually involves invoking slow tan- 
gential decay of the initially circular orbits at larger radii ow- 
ing to dynamical friction (a slow process, with characteristic 
timescales of order the Hubble time). 

3. CONSEQUENCES OF THESE ORBITS 

Having established that disk heating owing to mergers with 
somewhat radial orbits or efficient stripping is second-order 
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in Mi/ M\, whereas heating owing to mergers with rigid satel- 
lites on circular orbits is first-order in Mi/M\, what are the 
consequences of this? 

3.1. The "Allowed" Merger History of the Milky Way and 
Massive Disks 

First, consider what this means for the allowed merger his- 
tory of a given galaxy. Modulo the numerical prefactor, a rigid 
satellite on a circular orbit of mass ratio /i = M2/M1 will heat 
the disk to H/R ~ /i. For a 1:10 merger on a circular orbit, 
for example, this translates to H/R ~ 0.1, comparable to val- 
ues observed, setting a relatively stringent limit on how many 
mergers of even this small a mass ratio could have occurred 
without overheating or destroying a massive (Milky Way-like) 
stellar disk. However, if (more realistically) the same merger 
were to occur on a radial orbit and/or stripping were efficient, 
the heating would be suppressed by an additional factor ~ /1, 
giving H/R ~ /j 2 . For the 1:10 merger, then, this is an order- 
of-magnitude suppression in the disk heating! The previous 
constraints in fact would allow quite a large number of high 
mass-ratio (or even a few fairly low mass-ratio) mergers with- 
out problems for disk survival. 

Figure [2] re-examines the constraints on the Milky Way 
disk in light of this important distinction . Various observa- 
tions dWielenlll977t ICarlberg et al.lll985t lOuillen & Garnettl 
|2001t |Nordstrtim et al. 2004; Seabro ke & Gilmord I2007D 
have quantified the vertical velocity dispersion of stars in the 
solar neighborhood as a function of the mean stellar popula- 
tion age r. Given the circular velocity V c ~ 220 kms" 1 and one 
of the models for disk heating described in § 12.31 above, this 
can be translated to an upper limit on the allowed merger his- 
tory. Note that some of the observed dispersion could owe to 
purely internal effects - e.g. heating by two-body relaxation 
and scattering off of molecular clouds, spiral structure, and 
other local density perturbations - so assuming that the heat- 
ing comes entirely from mergers as given by the equations 
above is an upper limit to the allowed merger history (techni- 
cally gas complicates this comparison somewhat, since it can 
dissipate, but as we discuss below, the effect is not large and 
further supports our argument that the merger history could 
be much more violent than has been previously assumed). 

In detail, merger histories could be complex and mix many 
different mass ratios - however, in order to show the qualita- 
tive result in illustrative fashion, we convert each value cr(r) , 
compiled from the observations of Nordstr 5m et al.1 (|2004), 
to a maximal number of 1:10 mergers assuming each merger 
is exactly 1:10, that there has been no new accretion in the 
meantime (again, including such a term would allow even 
more violent merger histories), and that each merger heats the 
disk according to Equation (0. We plot the resulting "maxi- 
mum number of 1 : 10 mergers" as a function of lookback red- 
shift corresponding to each stellar age r, assuming either ra- 
dial/stripped orbits or rigid satellites on circular orbits in our 
conversion. We compare this to the median (and ±1 a range) 
number of mergers of this mass ratio or lower (more massive) 
integrated from z = to the corresponding lookback time r 
estimated from the dark matter merger rates as a function of 
mass and redshift, for halo masses similar to the Milky Way 
(~ 10 12 M Q , altho ugh the expec t ation is onl y weakly mass- 
depen dent), from IStewart et all (120071) and IFakhouri & Mai 
(2008) (note that for a given definition of merger mass ra- 
tio, the two calculations give answers in agreement to within 
~ 20%). 

The answer is as expected from above: if the merging orbits 



were all circular and satellites were rigid, transfer of orbital 
energy to disk heating would have been efficient, and the lim- 
its are quite stringent (significantly below the expected num- 
ber of mergers). On the other hand, if the mergers were ra- 
dial (i.e. most of their energy went into violently relaxing and 
heating only the central bulge stars) and/or stripping acts effi- 
ciently after a couple of passages, then a much larger number 
of mergers are permitted - in fact significantly more than are 
cosmologically expected in typical systems (allowing some of 
the observed heating to stem from internal processes, and re- 
flecting the fact that some - albeit not most - orbits will be 
more circular than others and yield somewhat more heating). 

Similarly, we can convert the observational constraints into 
a maximum allowed mass ratio merger for each stellar pop- 
ulation age (assuming the entire heating owed to a single 
merger, and using Equation to estimate the correspond- 
ing mass ratio for either a circular/rigid or radial/non-rigid in- 
teraction). Figure [2] shows these results as well, compared 
to the median expected maximum mass ratio merger since 
lookback time r (and ±ler range). The dramatic qualita- 
tive difference between orbits is again present. Regardless 
of how we represent merger histories, the constraints on disk 
heating/thickness easily allow for the expected cosmological 
merging histories if the orbits are primarily radial and/or satel- 
lites are non-rigid (with enough cushion that even if there is 
internal heating and the occasional circular orbit, there is still 
no significant tension). 

Note that while our derivation more than allows for the frac- 
tion of > 1:10 mergers in total, there is an intersection of our 
constraints with the maximum mass ratio merger around mass 
ratios ~ 1:4-1:3. This, of course, is expected. Roughly half 
of the observed galaxy population with the same mass as the 
Milky Way are bulge-dominated (SO or elliptical) galaxies, 
with most of this mass assemble d since z ~ 1 — 2 (see e.g. 
iBell et al.ll2003HBorch et~alll2006l) . We would expect that the 
Milky Way, being still disk-dominated, would therefore be be- 
low the median (in short, the half of galaxies that have had a 
~ 1:3 merger since z ~ 2 probably correspond to the half of 
galaxies at these masses that are ellipticals). Allowing for this 
fact (or equivalently comparing the distribution of expected 
mergers in systems that would still be disks today), there is no 
tension here (the allowed history for the Milky Way is consis- 
tent with that expected for fully ~ 1/3 of galaxies of similar 
mass, or roughly ~ 80% of the systems that will still be disk- 
dominated at z = 0). Furthermore, the constraints are even 
less stringent if we allow for gas accretion and/or subsequent 
growth of the galactic disk after early redshifts. 

This is quite generally applicable - not just to the Milky 
Way disk, but to the po pulation of observed edge-o n disk 
galaxies. For example, Dal canton & Bernst ein (2002]) esti- 
mate the age-scale height relations for a number of observed 
edge-on disks of various masses. The median trends are simi- 
lar to those implied by the Milky Way age-velocity dispersion 
relation (and in fact if we were to convert their estimates in the 
same manner in Figure |2] the resulting allowed ranges would 
lie within the same shaded range as estimated for the Milky 
Way, albeit with larger error bars). 

3.2. The Edge-On Shapes of Galaxies Along the Hubble 
Sequence 

Since both disk heating and bulge formation proceed from 
mergers, the differences above naturally predict a different 
relation between e.g. disk thickness and bulge-to-disk ratio 
along the Hubble sequence from very late type disks to bulge- 
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FIG. 2. — Constraints on the Milky Way merger history from the observed thick disk and local stellar age- velocity dispersion relation. Left: Number of 1:10 
mergers implied by the observed age-velocity dispers ion relation, s i nce a given redshift z. Eac h point represents the constraint from observed stellar velocity 
dispersions in the solar neighborhood (from Nordstrom et al. 2004; Seabroke & Gilmore 2007, circles and squares, respectively), at a given stellar age (here 
converted to a lookback redshift z). We convert <r 2 (age) to JVmergets assuming all heating comes from gas-free 1:10 mergers (given our analytic Equation (8)). 
Allowing gas makes the constraints more generous by a factor 1/(1 -/ gas ). We contrast Af me igers given our derivation of heating in realistic orbits (where 
Act 2 oc (Mt/M\ ) 2 ; red) and the derivation from Toth & Ostriker 1 1992) for rigid satellites on circular orbits (Act 2 oc MijM\ ; blue). Shaded range represents the 
typical factor ~ 2 scatter in normalization (see Figure [TJ. We compare the median A/mergers more major than 1:10 since redshift z ( solid black line; interquartile 
range shown as dashed lines) expected for a Milky Way-mass halo in ACDM cosmological simulations i Fakhouri & Ma 2008). Right: Same, but instead 
assuming that all the heating for each point was effected by a single merger - i.e. estimating the maximum a llowed merger since z. Solid line is the cosmological 
expectation as before, where halo mass is defined at each redshift; dot-dashed line from Stewart et al. 12007) is the expectation where the mass ratio is relative to 
the z = Milky Way mass (i.e. allowing for growth of the disk since a given redshift). Observational constraints, re-analyzed with the appropriate derivations for 
realistic orbits, easily accommodate the expected merger histories of Milky Way-like galaxies (allowing ~ 5 — 10 individual 1:10 mergers). 



dominated systems. In Hopki ns"et alj (|2008) we demonstrate 
that a mean relationship holds between the amount of classical 
bulge mass formed (violently relaxed and formed in a dissi- 
pational starburst owing to loss of angular momentum in the 
cold gaseous disks) and mass ratio, orbital parameters, and 
pre-merger gas content of merging galaxies. For otherwise 
fixed parameters at the time of merger, the amount of bulge 
formed scales roughly linearly in the mass ratio ~ fi, for the 
reasons briefly discussed in § l2.1l above. Correspondingly, the 
amount of disk heating scales according to our estimates, as a 
function of merger mass ratio and gas content. 

A proper quantitative calculation of e.g. the relation be- 
tween bulge-to-disk ratios and disk scale heights must account 
for the details mentioned above - including the evolution of 
merger history as a function of mass ratio and the correspond- 
ing evolution in gas fractions (which affect both bulge-to-disk 
ratios and disk scale heights, but in somewhat different man- 
ners). Moreover, these derivations apply only to systems in a 
post-merger status - more properly we would need to include 
the effects of accretion of new gas subsequent to each event, 
building up disk mass and making the disk more thin. Any 
quantitative estimates therefore require a much more com- 
plete cosmological model than we have here. 

However, we can highlight some of the salient qualita- 
tive behaviors that differ between our derived disk heat- 
ing expectation s for r ealistic orbits and the derivation of 
iToth & Ostrikerl d!992l) . by adopting a simplified toy model. 
Consider a set of disks where the bulge was entirely formed by 
the most recent merger, of mass ratio \i, and ignore subsequent 
accretion of new disks. The bulge mass fracti on will then be 
psjvwhere in detail we use the formulae from Hopkins et al. 
(2008) for a median expected orbit (averaging over random 
orbits) and small pre-merger disk gas fraction (< 10-20%). 



The surviving disk will have a thickness determined 
by Equations Q-dSll, or the corresponding equations from 
IToth & OstrjkeT (fl992) in the strict limit of rigid satellites 
on circular orbits - again we assume / gas <C 1 for conve- 



nience and use the exact normalizations from § 12.31 which 
agree reasonably well with full numerical experiments. We 
assume a disk height H/Rd w 0.1 for isolated systems, 
compar able to the expectations for purely internal h eating 
(see e.g. lBinnev & Tremaindfl987t [Sellwoo d et al.lll998h and 
isolated disk galaxy simulations ( Villalobos & Helmi 120081 : 
Youn geret al.l l2008). These details are important only for the 
exact numerical quantities in the profiles - the behavior of in- 
terest is that, for the cases we derive here, we roughly obtain 
H/R ~ (B/T) 2 (plus some constant; since AH/R oc /j, 2 and 
B/T oc fi), whereas for the IToth & Ostriken (Il992h case we 
have H/R ~ (B/T) (both cx fi). The qualitative relationship 
between H/R and B/T is different. 

Assuming the initial disk is Milky Way-like (giving it the 
appropriate size and mass), and giving the bulge the appropri- 
ate proj ected effect i ve rad ius from the spheroid size-mass re- 
lation in S hen et all (|2003) for its mass (knowing B/T), we can 
then construct a mock two-dimensional profile of the system. 
For simplicity, assu me that the "bulge" is spherically symmet- 
ric with a projected de Vaucouleurs ( 1948) profile and the disk 
is azimuthally symmetric with an exponential surface bright- 
ness profile and vertical sech 2 profile (with the estimated 
H/R). Note that it makes no difference to our qualitative 
comparisons whether we adopt e.g. a pure exponential ver- 
tical p r ofile instead, a s suggested by Barteldrees & Dettmari 
( 1994); lde~Griis et alj ( 119971) . Integrating through these mock 
profiles, we can construct the edge-on "surface brightness" 
map of the total system, and show this in Fi gure [3] For both 
our estimate of disk heating here, and that in Toth & O striker 
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FIG. 3. — Qualitative (toy model) expectations for the difference in edge-on morphologies of disk+bulge systems (with the given bulge-to-total stellar mass 
ratio B/T) given disk heating according to our derivation for realistic orbits (bottom) and that for rigid satellites on circular obits (top). In t he toy model, the bulg e 
fraction traces the mass ratio of the most recent (assumed gas-poor) merger, according to the calibration from numerical experiments in Hopkins et al. 12008); 
we then calculate the amount of disk heating given this merger, and plot the combined, projected edge-on surface brightness profile of the system. Because disk 
heating is much more efficient in the rigid satellite, circular orbit regime, disks in intermediate B/T systems are much thicker than observed - if heights scaled as 
predicted they would "swamp" the bulge and give rise not to a bulge plus disk morphology but to a series of decreasingly elliptical "boxy" (rectangular) isophotes. 
In contrast, our prediction for relatively inefficient heating means the disks even in intermediate B/T systems can be relatively thin (H/R < 0.2), leading to a 
standard bulge plus disk edge-on morphology and the characteristic observed "disky" isophotes in large bulges with embedded disks. 

this heating (and correspondingly the induced increase in disk 
scale height and vertical velocity dispersion) scale linearly in 
the mass ratio, oc M2 /M\ . 

This fundamental premise has largely informed the litera- 
ture on the subject of disk heating and empirical constraints 
on merger histories from the thickness and vertical profiles 
of observed disks. However, we show that it is no longer 
the case when the simplifying assumptions are relaxed (as in 
cur rent, more detailed simulations and cosmological models; 
e.g. [Velazquez & White]! 1999tlArdi et alj|2003fc Benson et alj 
l2004tlHavashi & Chibal2006tlK'azantzidis et al l2007l) . These 
assumptions are fine when one considers a system that is ef- 
fectively a point particle at large radius in an extended (but 
relatively low-density) background halo, on a moderately cir- 
cular orbit, such that there are no close passages of the satellite 
and primary and the entire regime of the orbit has a charac- 
teristic frequency much larger than the internal characteristic 
orbital frequency of either the primary or secondary - in the 
regime where the satellite is far from the galaxy and has just 
been accreted near the halo virial radius, for example. And 
in a strictly instantaneous sense, they are not bad. But in a 
global sense, in terms of estimating the effects on a galac- 
tic disk from a merger with initial mass ratio M1/M2, these 
assumptions break down, and the result is not just a numeri- 
cal correction, but a qualitative change in the character of the 
heating. 

In modern ACDM cosmologies, orbits tend to be highly ra- 
dial, especially those orbits that actually pass near or through 
the disk. Circular orbits efficiently lose angular momentum 
to interactions and other substructures in halo formation, and 
even when they survive, the dynamical friction times required 
for the orbit to decay from near the virial radius are so large 
(of order the Hubble time) that they tend to survive far away 
from the central galaxy rather than merge with and/or heat 
the disk. Moreover, once the galaxy gets sufficiently close to 
the disk that it might effect some heating, the characteristic 
orbital frequencies are (by necessity) similar to the character- 
istic internal frequencies of the disk and halo. Because some 
of the galaxy will be in near-resonance with the secondary 
orbit, this allows much more efficient transfer of angular mo- 
mentum from the orbital motion into internal motions, and 
even initially circular orbits become radial in just one or two 



d 1992b . we construct this toy model image for cases with net 
values B/T = 0.1-1. 

The difference in galaxy shapes is immediately apparent. 
Again, the quantitative details will, in reality, reflect a much 
more complex history of the system. Here, we focus on two 
qualitative d etails. First, if heating were as efficient as the 
derivation of lToth & Ostriker] dl992) implies, there are essen- 
tially no very thin disks, even in systems with low B/T. This 
is similar to what we have found in § 13. II above - the amount 
of heating for some merger history is, for realistic orbits, 
much less. Second, at intermediate B/T typical of early-type 
spirals and SO galaxies, the derivation herein implies (owing 
to the low efficiency of disk heating) that disks, while clearly 
thicker, are still thinner than the bulge itself, giving galaxies a 
reasonably defined disk plus bulge or disk plus spheroid edge- 
on morphology, comparable to what is canonically observed 
along the Hubble sequence. On the other hand, if heating were 
linear in /i, then it would more or less track B/T, giving thick 
disks (for B/T ~ 0.3-0.5, this gives disk H/R ~ 0.3-0.5, 
comparable to what is typically associated with galaxy bulges 
and extended spheroids, not "disks" in normal observational 
parlance). In fact, this is sufficient to largely suppress the 
"disk plus bulge" morphology entirely - the Hubble sequence 
and changing bulge to disk ratios do not appear as such when 
viewed edge on, but rather (visually) would appear to be pri- 
marily a sequence in disk thickness. 

4. DISCUSSION AND CONCLUSIONS 

We have shown that allowing for realistic satellite orbits - 
in particular the fact that orbits are not exactly circular, and 
that satellites tend to be increasingly stripped as they execute 
multiple passages through a disk - has dramatic implications 
for the efficiency and cosmological evolution of disk "heat- 
ing" via minor mergers and galaxy-galaxy in teractions. 

In previous generations of analyses (e.g. iToth & O striker 
fl992h . several assumptions were made implicitly: that a satel- 
lite would enter a disk on a slowly-decaying circular orbit ow- 
ing to dynamical friction, and that the satellite was rigid. In 
addition, much of this work focused on dissipationless (gas- 
free) cases. Under these conditions, it is straightforward to 
derive the fractional disk heating owing to a merger of a satel- 
lite of mass M2 with a primary of mass Mi , and to show that 
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passages. Because the orbital angular momentum is trans- 
ferred quickly in a passage, the secondary cannot gently "spi- 
ral in" but must return from apocenter in a radial "plunge" 
- the secondary effectively free-falls inwards (weakly decel- 
erated by e.g. dynamical friction at this point) and "punches 
through" the primary, where it executes a rapidly damping os- 
cillation through the primary on nearly radial orbits. At this 
stage, violent relaxation scatters stars off the time-dependent 
potential and transfers the orbital energy to the central stellar 
distribution (the bulge or whatever the hottest stars are at the 
center of the galaxy), damping the orbit and completing co- 
alescence in a couple of dynamical times. At the same time, 
once the secondary begins to penetrate the primary disk, it 
experiences rapid tidal mass loss and soon most of the mass 
has been tidally added to the extended halo and cannot further 
heat the disk. 

For these reasons, the most useful approximation for disk 
heating in realistic cosmological situations is not a gradually 
decaying circular orbit of a rigid satellite, but rather some- 
thing like the impact approximation: the heating is dominated 
by a single or a couple of radial oscillations at the initial mass, 
on a radial free-fall trajectory. We show that in this case, the 
heating is not linear in the mass ratio Mo/Mi. Rather, the ef- 
fective heating is second-order in the mass ratio, cx (M^jMx) 1 . 
We show that - again for the reasons above - this is a much 
more robust scaling. 

We demonstrate this by comparing our analytic predictions 
to a variety of numerical simulations, which include a range 
in secondary mass ratios, assumed structural properties of the 
disk, halo, bulge, and satellite, gas content, redshift, orbital 
parameters, and orbital anisotropy and angular momentum 
content. Remarkably, these different simulations, from var- 
ious independent groups, all agree with our analytic predicted 
scalings and with each other (when properly normalized) to 
within a factor ^2-3. It is straightforward to show that a lin- 
ear scaling in mass ratio is not a good description of the sim- 
ulation results (even if the normalization of the linear scaling 
is free - i.e. the scaling itself must be second-order). 

This provides a physical reason why simulations increas- 
ingly find, contrary to historical expectations, that sys- 
tems can undergo cosmologically expected merger histories 
and still yield relativ ely thin disks at z = in line with 
observations (see e.g. IVelazquez & White! [19991; lArdi et alJ 
2003 1 ; lAbadi et alj|2003t iBenson et al]|2004l IRobertson et all 
2004 I2006t lOkamoto et all 120051; iHayashi & Chibal 120061; 
Governato et al. 2007; Kazantzidis et al. 2007; Younger et al. 
2008t[Villalobos & Helmi 200$). Many of these studies found 



that idealized numerical exp eriments yield less effic ient disk 
heating than that predicted bv lToth & Ostrikerid 19921) ; the key 
contribution of our work is to outline the physical reasons for 
this difference and to demonstrate that these differences are 
not a matter of e.g. the normalization or numerical prefactors 
involved in an estimate of disk heating: heating is second- 
order in mass ratio (/i). 

This qualitatively changes the disk heating paradigm. Be- 
cause the mass spectrum of mergers (in terms of halo mass) 
goes as dN /dln/i cx fT 1 , if the heating per encounter were in 
fact linear (cx /.t), then there would be more or less equal con- 
tributions to disk heating and thick disk formation from each 
logarithmic interval in mass ratio. Observationally, this would 
imply that no more than a few dwarf galaxies with properties 
similar to those of e.g. the rather s mall Sagittarius dwarf (see 
Helmi 2004; Fellhauer et al. 2006) could have been accreted 



or destroyed by the Milky Way in the last lOGyr, in conflict 
with cosmological expectations and making the well-known 
"missing satellite" problem even more difficult to resolve. 

In contrast, if the heating is second-order in /i, then the net 
heating is dominated by the few large mergers of relatively 
large mass ratios. This behavior has in fact been recognized 
in cosmological sim ulations, and yiel d s better agreement with 
observations (see lAbadi et alJ 120031 ; IRobertson et al.l 1200' 



Benson et a il 12004 lOkamoto et al] I2005F iGovernato et 



20071: iKazantzidis et alj|2007t iRead et alj|2008l) . It also fun 
damentally enables the existence of very thin disks: since 
mergers of relatively large mass ratios /j, > 0.1 are rare, they 
are somewhat stochastic - i.e. some systems which have 
gone sufficient time or been sufficiently gas rich since their 
last ^1:10 merger will exist and can therefore have very 
thin disks (H/R < 0. 1) similar to the "ultra-thin" or "super- 
thin" disks observed (Seth et al. 20 051 lYoachim & Dalcantonl 
2006). If instead the same net heating came from mergers of 
mass ratios /j, ~ 0.001 -0.01 (where the expected number of 
mergers since z ~ 1 is of order several hundred per galaxy; 
making the existence of any galaxies without such mergers in 
their recent past an improbable event) as from mergers with 
mass ratio fj, ~ 0.1 — 1, then it would be almost impossible 
to contrive realistic cosmological merger histories that could 
give rise to ultra-thin disks. 

There are a number of other indirect implications of this re- 
sult. For example, we demonstrate that this is of significant 
importance for the edge-on morphologies of disk plus bulge 
systems. Given some bulge formation in mergers, whether 
the induced heating in the surviving disk goes linearly or 
second-order in /i produces a different relation between scale 
heights and bulge-to-disk ratios and, correspondingly, differ- 
ent morphologies in observed early-type disks. If realistic 
scale heights increased as efficiently as expected in the cir- 
cular orbit, rigid secondary approximation, then disks of in- 
termediate B/T would necessarily be thicker than observed 
- in fact, the Hubble sequence, viewed edge-on, would no 
longer have a characteristic "bulge plus disk" morphology and 
would instead appear as a sequence in ellipticity of character- 
istically boxy isophotal shapes (because a sufficiently thick 
disk appears in edge-on projection as somewhat rectangular). 
In contrast, disks appear to remain thin even in systems with 
significant (or dominant) bulges, giving rise to characteristi- 
cally "disky" ("lemon-shaped") isophotal shapes. The non- 
linearity of disk heating can explain why galactic morpholog- 
ical components form bimodal families (bulges and disks), as 
opposed to a sequence of increasingly "spheroidal" disks. 

Applying these recalibrated models to the Milky Way and 
other massive disk galaxies, the implied constraints on the 
merger history from the thick and thin disk are dramatically 
changed and fully consistent with cosmologically expected 
merger histories. In detail, applying our scaling, calibrated 
against numerical simulations, to the observed age-velocity 
dispersion relation in the solar neighborhood, implies that the 
Milky Way could have survived as many as ~ 5 — 10 minor 
(mass ratio ^ 1:10) mergers in the last lOGyr. If some of 
the disk thickness owes to other processes or there were some 
more violent events, this number might be a bit smaller, but if 
the Milky Way were more gas-rich in the past, then this num- 
ber would in fact go up by a factor 1 /(l — / gas ). In either case, 
the median number of such mergers expected for a Milky- 
Way mass halo is only ~ 3 (with ~ 90% of Milky- Way like 
systems having < 5-6 such mergers in this interval). The ob- 
served vertical velocity dispersions of stars (and implied disk 
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thickness) of the Milky Way is completely consistent with the 
cosmologically expected amount of minor merging in the last 
lOGyr, and in fact these constraints alone permit the Milky 
Way, far from being especially quiescent, to be in the upper 
cosmological quartile in terms of number of minor mergers 
since z ~ 1-2. Translating these constraints to a "maximum" 
mass ratio merger implies that the Milky Way could even have 
survived mergers of mass ratio ~ 1:4-1:3 since z ~ 2, without 
producing too much heating or thick disk. 

Because the heating is second order in mass ratio, there 
is almost no measurable heating effected by encounters with 
mass ratios /i <C 0.1, so there is no overall constraint based 
on the thickness of the disk or internal velocity dispersions on 
how much total mass has been added to the Milky Way via 
m erging. In contrast to the < 5% growth since z ~ 2 inferred 
bv lTofh & Ostrikeri dl992l) . we find that the Milky Way could 
have accumulated up to ~ 25 - 50% of its present mass in the 
last lOGyr by 1:10 mergers alone (~ 5— 10 individual 1:10 
mergers). More interesting, because again smaller mass ratio 
mergers heat only to second order in /i, the Milky Way could 
easily, in fact, have grown by factors of several since z ~ 2, 
provided this growth mostly came from relatively small mass 
ratio mergers /i < 1:10. This is critical for our understanding 
of galaxy formation: at some level of sufficiently small mass 
ratios below ~ 1:10-1:100, the distinction between accretion 
and mergers is increasingly blurred. Essentially all galaxies 
are expected to gain a large fraction of their mass and grow by 
large factors (up to an order of magnitude) in the last ~ 1 Gyr 
from such accretion events. If these small events heated as ef- 
ficiently as major events (e.g. if ~ 30 separate 1:100 mergers 
were as damaging to a disk as a 1:3 merger), there would be a 
fundamental difficulty for CDM models. In fact, owing to the 
inefficiency of disk heating, there is no tension here. 

This is not, of course, to say that problems of disk for- 
mation and survival in a ACDM Universe have been solved. 
There are still well-known problems related to the loss of an- 
gular momentum in disks (owing in part to over-merging), 
that tend to yield (in cosmological simulations) disks that 
are too concentrated and have too much bulge relative to 
observations. A gr eat deal of theoretical effort has gone 
into studying this (Weil et all 119981: ISommer- Larsen et al.l 
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Okam oto et al.ll2.005t IScannapieco et alj|2008b . with increas- 
ing agreement that some combination of improved resolution 
and proper implementation of feedback from supernovae and 
stellar winds are critical for the formation of realistic disks. 
These issues are not addressed or affected by our analysis 
here. 

However, it is increasingly clear (as resolution improves 
such that thick disks can be properly resolved in the verti- 
cal direction) that these simulations do not generally have the 
problem of producing disks that are much too thick relative 
to observations. In this paper, we have shown that there is a 
good physical reason for this, and that it is not a peculiarity of 
a couple of simulations or of an especially quiescent merger 
history, but rather is a robust and quite general expectation 
even for systems with violent merger histories and even in sys- 
tems that will be bulge-dominated at z = 0. We have derived 
a new, more generally applicable scaling of disk height with 
merger mass ratio, and shown that it qualitatively changes the 
expectations for how disk heating should proceed in a ACDM 
universe, and generically leads to the expectation that disks 
are quite robust to being heated by minor mergers. Although 
there are still considerable improvements in models needed to 
explain when and how disks form and why the distribution of 
bulge-to-disk ratios is what is observed at z = 0, there is no 
tension between the models and the heating and thickness of 
disks. In short, we have argued that there is no "disk heating" 
problem in ACDM cosmologies, once a proper accounting of 
disk heating for realistic cosmological orbits and conditions 
is considered. 
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